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The sequences of the peplomeric SI protein of four serologically distinct strains of the infectious bronchitis virus 
(IBV), an avian coronavirus, have been determined. The SI protein is thought to contain the serotype-specific neutral¬ 
ization epitopes and to be the main target of antigenic variation. An alignment with sequences of three strains published 
previously showed that from the 545 amino acid residues only 243 have been conserved. Clustering of substitutions 
suggests that most serotype determinants are located within the first 300 amino acid residues of SI. A phylogenetic 
tree of the SI sequences showed very variable rates of divergence. Differences in topology with a tree based on RNAse- 
T1 fingerprint data indicate that some of the IBV strains have arisen by genetic recombination. © 1 989 Academic Press, inc. 


Avian infectious bronchitis (IB) is a worldwide dis¬ 
ease, which is caused by a coronavirus and results in 
a highly contagious respiratory affliction of young 
chickens or in a decrease in egg production (35). Ini¬ 
tially, IB was effectively controlled by vaccination with 
live attenuated IB virus (IBV). However, this did not pre¬ 
vent outbreaks caused by variant viruses {10, 13). 
RNase T1 fingerprinting analysis showed that field 
strains isolated from such outbreaks are related to vac¬ 
cine strains (24), suggesting that the new strains have 
originated from vaccine virus by antigenic variation. As 
in other RNA viruses, antigenic variation is probably fa¬ 
cilitated by the relatively high error rate of transcription 
(10~ 3 ) during the transcription of the RNA template and 
the absence of a proofreading mechanism ( 18, 33). 

It is generally assumed that the serotype of IBV is 
determined by the glycoprotein E2, which is the struc¬ 
tural component of the peplomers, the typical club- 
shaped structures projecting from the surface of the 
virus. E2 is processed proteolytically to two noncova- 
lently bound peptide chains, Si and S2 (6, 34). S2 con¬ 
tains the C-terminal half of the sequence, including the 
transmembrane anchor and two long a-helices that 
form the stalk of the peplomer (14). Si forms the top 
part of the peplomer and presumably carries the sero- 
typical determinants. This assumption is based on the 
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findings that all strongly neutralizing monoclonal anti¬ 
bodies recognize Si, and that immunization with puri¬ 
fied SI, but not with virus lacking SI, induces neutraliz¬ 
ing antibodies (5, 7, 29). 

To investigate the serotypic variation of IBV at the 
molecular level, we determined the Si sequences of 
four IBV strains (13, 24): HI20, an attenuated vaccine 
strain of serotype A; D207, the reference strain of sero¬ 
type B; D1466, a vaccine strain of serotype C; and 
VI397, a recent Dutch isolate from serotype A/C. 
These sequences were compared with the sequence 
of three strains published previously (3, 4, 3): M41, a 
pathogenic strain from serotype A used in a killed-virus 
vaccine; M42, a nonpathogenic laboratory strain of se¬ 
rotype A; and 6/82, a recent British field isolate of sero¬ 
type B (10). 

Virus strains were obtained from the Poultry Health 
Institute, Doom, The Netherlands. Details on the isola¬ 
tion and passage history have been described (24). 
Strains were passaged once in the allantoic cavity of 
10-day-old chicken embryos. Virus stocks were stored 
at -70°. Virus growth, isolation of genomic RNA, cDNA 
synthesis, cloning, and sequencing were carried out 
essentially as described previously (37). With VI397, 
cDNA synthesis was primed using random calf thymus 
pentanucleotides (Pharmacia). By screening of colo¬ 
nies with probes containing Si sequences of the M41 
strain, Si clones of D207 and HI20 were detected. 
Partial sequencing of a D1466 clone with a large insert 
yielded Si sequences that could be used as a probe to 
obtain D1466 as well as VI397 clones. 

Most sequences were based on two or more inde¬ 
pendent cDNA clones. Only with strain D207, three nu- 
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Fig. 1. Amino acid sequences of IBV SI proteins. The sequence from M41 variant M41U has been listed completely; from other strains only 
the differences with this sequence are shown. The sequences from M41U, its cDNA variant M41U* (Ala on position 398, unpublished) and from 
the M42 variant M42S are from Niesters etaf. {31). The sequence of M42 variant M42H is from Binns et al. ( 3 ). The sequences of M41 variant 
M41H and of strain 6/82 are from Binns et al. (4). D207* represents a cDNA variant of strain D207 (Glu-48 as main variant was found by 
sequencing independent clones, Lys-117 by direct RNA sequencing). Dashes were introduced to align the sequences. Potential glycosylation 
sites (NXS or NXT, X A P) are underlined. Conserved cysteine residues are in boldface. 


cleotide differences between cDNA clones were ob¬ 
served, two of them leading to amino acid substitu¬ 
tions. During the sequence determination of strain 
M41, one difference between two cDNA clones was 
observed. From the nucleotide sequences (submitted 
to the EMBL, GenBank, and DDBJ nucleotide se¬ 


quence databases), amino acid sequences have been 
deduced. In Fig. 1, the amino acid sequences are listed 
together with the Si sequences published previously. 
As observed earlier { 31 ), sequence differences on a 
number of positions show that different laboratory vari¬ 
ants of a strain can exist; our HI20 sequence has 
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Fig. 1— Continued. 


five differences with M41 not reported by Cavanagh 
etal. (9). 

The length of the SI protein varies between 535 and 
538 amino acids, including the signal peptide and the 
arginine-rich cleavage site between Si and S2 (6). The 
sequences could be aligned by assuming deletions/in¬ 
sertions at 14 positions. Two SI proteins can have 
different amino acids in up to 49% of the positions of 
the sequence (Table 1). However, from the 17 to 19 


TABLE 1 


Difference Matrix of Si Sequences of IBV Strains® 



M41U 

M42S 

HI 20 

D207 

6/82 

D1466 

VI397 

M41 U (A) 


4.7 

3.9 

22.6 

22.4 

44.4 

44.4 

M42S (A) 

4.7 

— 

3.4 

21.6 

21.5 

45.0 

45.0 

HI 20 (A) 

3.9 

3.4 

— 

21.5 

21.5 

44.0 

44.0 

D207 (B) 

22.6 

21.6 

21.5 

— 

1.1 

48.8 

48.0 

6/82 (B) 

22.4 

21.5 

21.5 

1.1 

— 

48.4 

47.8 

D 1466(C) 

44.4 

45.0 

44.0 

48.8 

48.4 

— 

5.8 

VI397 (A/C) 

44.4 

45.0 

44.0 

48.0 

47.8 

5.8 

— 


a The figures represent the percentages of nonidentical amino 
acids. The designations of the strains are as in Fig. 1. From each 
strain, only one variant has been listed. 


cysteine residues, 16 have been completely con¬ 
served, as are most of the glycosylation sites. Presum¬ 
ably, the sequence variability is the combined result of 
the accumulation of neutral substitutions and the posi¬ 
tive selection of antigenic variants; the relative high fre¬ 
quency of nonsilent mutations in several parts of the S1 
sequence (not shown) suggests a positive selection. 

To localize the most variable regions, the number of 
different amino acid residues found on all 545 positions 
is plotted in Fig. 2. On 243 positions, the amino acid 
residues are conserved in all strains. Although there 
are no clearly defined hypervariable regions—as forex- 
ample, in the HIV envelope protein (30), the VP1 protein 
of foot-and mouth-disease virus (2, 1 7), and the rotavi¬ 
rus VP7 protein (16, 17 )—there are relatively many re¬ 
placements in the regions 50-170 and 250-310. Inser¬ 
tions/deletions were mainly found in the region 120— 
170. These observations suggest a localization of most 
of the serotypic and antigenic determinants in the N- 
terminal half of the SI subunit. 

A more accurate definition of the antigenic determi¬ 
nants may be derived from a comparison of similar se¬ 
quences of strains with different serological properties. 
It has been suggested (9, 31) that in serotype-A strain 
the clustered substitutions in two regions, HVR 1 (56- 
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Fig. 2. Number of different amino acid residues per position in the sequences listed in Fig. 1. 


69 in the numbering of Fig. 1) and HVR 2 (117-133) 
coincide with neutralization epitopes. Indeed, a muta¬ 
tion in HVR 1 prevented neutralization by two different 
monoclonal antibodies (9). From the six differences be¬ 
tween the serotype-B strains D207 and H6/82, three 
are in a region corresponding to HVR 2. 

The epitopes of IBV recognized by neutralizing 
monoclonal antibodies against SI are conformation 
dependent (unpublished data). As described else¬ 
where {25; Kusters et at., unpublished results), the 30 
N-terminal residues of the S2 subunit contain several 
overlapping conformation-independent epitopes that 


evoke a weak neutralizing response. Since these epi¬ 
topes cross-react with antisera against different sero¬ 
types, they are not relevant forthe serotype of the virus. 

An alignment of the SI nucleotide sequences was 
used to calculate a distance matrix, from which the 
most likely phylogenetic tree was inferred by a program 
distributed as part of the PHYLIP package 2.6(75). The 
topology and branch lengths of this tree, shown in Fig. 
3A, were not affected by shuffling the order of se¬ 
quences. Intriguingly, there are differences in topology 
between this tree and a tree based on RNase-T 1 fin¬ 
gerprints (Fig. 3B). First, M41, M42, and HI20 are 
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Fig. 3. Phylogenetic relationship between IBV strains. (A) Tree of Si sequences. The percentage of nucleotide relatedness was calculated 
forthe strains listed in Table 1. The figures indicate percentages of replaced nucleotides. The tree is unrooted, i.e., the position of the hypotheti¬ 
cal ancestral IBV sequence is unknown. (B) Tree representing genomic relatedness. Percentages are estimated from RNase-TI fingerprint 
analysis (/). The serotype is indicated between parentheses. The data are from Kusters ef al. {24), except for M42 (12; J.G.K., unpublished). 
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placed apart in the T1 tree (<95% identity) but have 
closely related SI sequences >97.5% nucleotide 
identity). Second, the considerable divergence of the 
SI sequences of strains D1466 and D207 is not re¬ 
flected in the T1 tree, nor in a tree (not shown) based 
on sequences of the El genes (8). Third, VI397 is re¬ 
lated to HI 20 in the T1 tree >99.5% overall sequence 
similarity), but to D1466 in the SI tree. 

Theoretically, the first two of these discrepancies 
might be resolved by assuming extreme variations of 
the evolutionary rate within the viral genome. However, 
this would not explain the similar SI sequences of 
VI397 and D1466 vs the common RNase-TI spots of 
VI397 and Hi 20. Instead, we propose that genetic re¬ 
combination has played a role in the generation of anti¬ 
genic variants. For instance, VI397 may have acquired 
a D1466-like peplomer gene. In the murine coronavirus 
MHV, recombination occurs at a rather high frequency 
[19, 20, 26) and may alter the serotype of the virus {27). 
For IBV, conditions that favor recombination are cre¬ 
ated in the field by vaccination of chickens with more 
than one attenuated IBV strain. Thus, infections of cells 
with two different strains, leading to the formation of 
recombinants, may very well have occurred. 

RNA recombination has been well documented for 
picornaviruses {21-23, 28, 32). It would be interesting 
to test our hypothesis that recombination also plays a 
role in the generation of new IBV variants. Such a test 
might be based on the localization of the recombina¬ 
tion site or on in vitro recombination experiments. 
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